Using decision trees to construct optimal acoustic cues
نویسندگان
چکیده
This paper presents an approach to the optimization of acoustic cues used for stop identi cation in the context of an acoustic-phonetic decoding system which uses automatic acoustic event extractors (a formant tracking algorithm and a burst analyzer). The acoustic cues have been designed on the basis of acoustic studies on stops and spectrogram reading experiments. This ensures that these cues have a certain amount of discriminating power but we do not know either the optimal thresholds nor which combination of cues are the most e cient. Therefore, we propose to use the decision tree theory [4] to choose the most discriminating cues and to improve their discrimination power. Considering the stop occurrences of a training corpus, the best cues are those which allow the decision tree leading to the best partition to be constructed. We have considered all the cues derived from the ones provided by the phonetician on formant transitions and burst characteristics. The improvement of the cues has been achieved on a corpus of 941 stops.
منابع مشابه
Machine learning techniques for the identification of cues for stop place
This paper is situated in a long line of phonetic studies that seek to determine and qualify the acoustic cues humans use to identify stop place. The present study draws from a database of 1500 CV tokens of American English and their values for the acoustic features thought to be cues for stop place identification, including (1) VOT, (2) energy of the burst and release, (3) spectrum at the burs...
متن کاملUsing Robust Decision Tree Construction for Continuous Speech Recognition
Context-dependent units using decision tree have been broadly used to model the co-articulation effects and the speech variation in speech recognition. Decision trees are generally constructed in a data driven way and guided by linguistic information that contains a priori phonetic knowledge. In this paper, a two-stage splitting criterion is proposed to effectively construct the decision trees....
متن کاملAn empirical analysis of user uncertainty in problem-solving child-machine interactions
With the widespread use of technologies directed towards children, child-machine interactions have become a topic of great interest. Computers must interpret relevant contextual user cues in order to provide a more natural interactive environment. Our focus in this paper is analyzing audio-visual user uncertainty cues using spontaneous conversations between a child and computer in a problem-sol...
متن کاملSome acoustic cues to human and machine estimation of speaker age
Two experiments were carried out in order to learn more about the relation between the various acoustic cues to speaker age. The first included listening tests with resynthesized stimuli, and the second comprised automatic estimation of age using the CART (Classification And Regression Trees) technique. In the first experiment, results indicate that human listeners seem to rely more on spectral...
متن کاملOptimal Generalized Decision Trees via Integer Programming
Decision trees have been a very popular class of predictive models for decades due to their interpretability and good performance on categorical features. However, they are not always robust and tend to overfit the data. Additionally, if allowed to grow large, they lose interpretability. In this paper, we present a novel mixed integer programming formulation to construct optimal decision trees ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1996